Generalized Speedy Q-Learning

نویسندگان

چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speedy Q-Learning

We introduce a new convergent variant of Q-learning, called speedy Q-learning, in order to address the problem of slow convergence in the standard form of the Q-learning algorithm. We prove a PAC bound on the performance of SQL, which shows that only T = O ( log(1/δ)ǫ(1 − γ) ) steps are required for the SQL algorithm to converge to an ǫ-optimal action-value function with high probability. This ...

متن کامل

Speedy Q-Learning: A Computationally Efficient Reinforcement Learning Algorithm with a Near-Optimal Rate of Convergence∗

We consider the problem of model-free reinforcement learning (RL) in the Markovian decision processes (MDP) under the probably approximately correct (PAC) model. We introduce a new variant of Q-learning, called speedy Q-learning (SQL), to address the problem of the slow convergence in the standard Q-learning algorithm, and prove PAC bounds on the performance of this algorithm. The bounds indica...

متن کامل

PRESENTING GENERALIZED q - SCHUR

متن کامل

Generalized Q-functions

The modulus squared of a class of wave functions defined on phase space is used to define a generalized family of Q or Husimi functions. A parameter λ specifies orderings in a mapping from the operator |ψ〉〈σ| to the corresponding phase space wave function, where σ is a given fiducial vector. The choice λ = 0 specifies the Weyl mapping and the Q-function so obtained is the usual one when |σ〉 is ...

متن کامل

GENERALIZED q - FIBONACCI NUMBERS

We introduce two sets of permutations of {1, 2, . . . , n} whose cardinalities are generalized Fibonacci numbers. Then we introduce the generalized q-Fibonacci polynomials and the generalized q-Fibonacci numbers (of first and second kind) by means of the major index statistic on the introduced sets of permutations.

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Control Systems Letters

سال: 2020

ISSN: 2475-1456

DOI: 10.1109/lcsys.2020.2970555